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A Method and Apparatus for Authoring 
of Customizable Multimedia Documents 

This is a continuation of PCT/CA98/00 771 filed August 11, 1998. 
This invention relates to a method and apparatus for the authoring of 

customizable multimedia documents and the adaptive generation of versions thereof 
for particular uses. 



Background of the Invention 

Natural Language Generation (NLG) is a young but growing research field, 
whose goal is to build computer systems that automatically produce fluent and 
effective texts in various human languages. Generally, NLG systems have used 
knowledge databases containing general world knowledge and specific domain 
knowledge, together with various linguistic resources (e.g., lexicons, grammars, 
discourse relations), to produce texts with limited variation in word choice, sentence 
and discourse structure, and virtually no variation in rhetorical style or pragmatic 
puipose. 

While various computational systems have been devised as solutions to the 
problem of producing documents with limited expressiveness in form and effect, none 
has presented a general solution to the problem of representing the kinds of 
knowledge that are needed to produce documents tailored to a specific use or 
audience in a manner that is systematic and extensible, and that further provides for 
the authoring of such documents by a non-computer-programmer professional writer. 
In addition, no system has yet presented a general solution for automatically 
integrating various aspects of document design (e.g., text, graphics, and presentation 
layout) into a single consistent representation format for use within a document 
intended for customization. 

It is well-known from studies in communication that presentation of 
information in a manner that is tailored to the characteristics of a particular audience 
can be significant both in maintaining the interest of the members of the audience and 
in effectively conveying the meaning of the information. For example, in the health 
care industry, it has been shown that infomiation that is tailored to the characteristics 
of an individual patient can have a far greater effect in producing compliance with 
suggested medical regimens as compared to generic information. (Strecher et al 1994 
have done pioneering work in this area). 
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But as Strecher et al's behavioural studies also showed, a sizeable number of 
dilfcrcnt medical and personality factors had to be taken into account in producing 
customized health information that would have the desired effect on the intended 
patient. DiMarco et a! (1995) noted that this kind of customization involves much 
5 more than producing each brochure or leaflet in half a dozen different versions for 
different audiences. Rather, the number of different combinations could easily be in 
the tens of thousands. While not all distinct combinations might need distinct 
customizations, it is nonetheless impossible in general to produce and distribute, in 
advance of need, the large number of different editions of each publication that is 
10 required for individual tailoring of information. 

Thus there is a need for a computer system for the automated production of 
customized material that would tailor a general-purpose ''master document'' for a 
particular purpose or individual on demand. It must also be remembered that in the 
present context the term ''document" is broadly used to define any textual or non- 
15 textual data, including multimedia and hypertext, having inter-relationships between 
the data, and that may be displayable or presented to a human audience in one of 
many presentations and formats. 

As a further example, a master document may refer to the complete superset of 
instructions to direct the actions of a robot on an assembly line. In this instance, there 
20 exists a need to tailor or adaptively generate subsets of combinations of instructions 
for specific robot applications. Wliether the master document is to be customized for 
a particular purpose, as in the robot example, or tailored for a specific audience, as in 
the case of health information, this process of adaptive document generation should 
be easily implementable on a computer system at minimum possible cost and 
25 maximum possible ease of use to both the author of the master document and the user 
of the generation system. 

hi the field of natural language processing, or, computational linguistics, 
various computer systems have been implemented which attempt to produce 
customized documents. In the simplest cases, simple mail-merge techniques are used 
30 which enable ^'personalized" documents to be generated by using hand-coded decision 
rules indicating what information is to be included for various tailoring situations. 
However, these techniques result in very inflexible, and often, awkw^ardly structured, 
and poorly cohesive texts. Other systems utilize schema-based techniques to select 
and organize the content data according to simple document-template structures. But 
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these tcniplalcs arc cillier too general-purpose to provide anything n^orc than very 
eoarse-grained adaptation in the resulting customized texts or too specific to the 
appHcation in question to be appropriate for general use in adaptive generation 
systems. 

5 A number of projects have used more sophisticated techniques from NLG 

research to build adaptive generation systems for both written texts and hypertext 
documents. The IDAS project (Reiter, Mellish, and Levine 1995) recognized the 
need to tailor both textual and non-textual information, including visual formatting, 
hypertext input, and graphics output. IDAS also tried to address the need for explicit 

]() authoring tools in the adaptive document generation process, but here the focus was 
on authoring at the knowledge-base level (i.e., at the level of a computer system's 
internal representation), while there still exists a need to provide an authoring tool that 
may be used by a non-computer-programmer professional writer who could compose 
the master document at the level of ordinary English, with additional markup as 

15 required (e.g., HTML markup to support an HTML presentation format for a resulting 
customized version of the document). IDAS relies mainly on canned texts and aims 
to provide the user with a means of navigating through the whole "hyperspace" of 
possible (caimed) texts. There is however a need to provide for a much finer-grained 
degree of tailoring than the IDAS implementation. 

20 While IDAS relies mainly on canned texts, other adaptive generation systems 

do use more-dynamic text generation: the Migraine system (Carenini, Mittal, and 
Moore 1 994) uses an approach to text planning that adaptively selects and structures 
the information to be given to a particular reader. Flowever, Migraine relies on a 
large number of context-sensitive and user-sensitive ''text plans" (i.e., text schemas) 

25 so that its methods of tailoring must of necessity be very specific to its particular 

domain. The PEBA-Il system (Milosavljevic and Dale 1996) uses more-general text 
plans, as well as text templates, that it can choose from to adapt information to the 
individual reader, but the tailoring done is very specific, focussing on the user's 
familiarity with a topic. The PIGLET system (Cawsey, Binsted, and Jones 1995) also 

30 uses a combination of text plans and text templates, but its tailoring is also quite 

specific in nature, mainly concerned with emphasizing material that is relevant to a 
particular patient. The ILEX-0 system (Knott, Mellish, Oberlander, and O'Donneil 
1996) is similar to the PIGLET model in its anticipation of all the possible texts that 
might be generated, but also includes annotations (e.g., a condition on a piece of 
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canned text) to allow some local customization. However, very free and flexible use 
of annotations could lead to problems of repetitive text and inappropriate use of 
referrii]g expressions in the resulting document, requiring textual repair. 

None of the previous systems provide for a text-repair facility of the kind 
5 described by Hovy and Wanner (1996) and Wanner and Hovy (1 996). The paradigm 
of adaptive document generation by "selection and repair"', as introduced by DiMarco, 
Hirst. Wilkinson, and Wanner (1995), that is, selection of the relevant pieces of 
information from a master document, and then repair of any syntactic or stylistic 
problems in the resulting document by a text-repair facility, is central to the goals of a 

10 customizable document system. However, the system should be able to support either 
an adaptive generation system with full facilities for selecting and repairing texts, as 
described by DiMarco, Hirst, Wilkinson, and Wanner (1995) and Hirst, DiMarco, 
Hovy, and Parsons (1 997), or a simpler version of the system, based on ''generation 
by selection onl> '\ i.e., with no facilities for textual repair, an implementation of 

15 which (called ''WebbeDoc'') is described by DiMarco and Foster (1997). 

In summary, an author of a customizable document needs to be able to 
describe the variations of a document, which may be both textual and non-tcxtual, at 
various levels of the document structure, together with the conditions for selecting 
each variation. 

20 The author then needs a means of selecting all the appropriate variations for a 

paiticular purpose or audience, re-assembling the selected variations into a coherent 
document, and producing an appropriately customized version of the document, in 
potentially many different levels of representation (e.g„ surface English, a deep 
syntactic or semantic representation for use in textual repair) and presentation formats 

25 (e.g., HTML, LaTeX), 

None of the existing adaptive document generation systems has provided a 
generally applicable method and apparatus for describing all the different ways in 
which a document could be customized, or for providing for a non-computer- 
programmer author of a customizable document to specify the possible variations, or 

30 for selecting the appropriate variations and producing a customized version of a 
document. 
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Summai-}' of the Invention 

ll is therefore an object of this invention to provide a system wliich mitigates 
to some extent the above outlined disadvantages. Also, the methods used by the 
present invention are more general than those used in previous systems, allowing not 

5 only the potential inclusion of text plans and schemas, text templates, and canned text, 
but also the dynamic generation of text that can then be subjected to very fme-grained 
revision and tailoring by a text-repair facility. 

This invention seeks to provide a computer system for customizing an initial 
master document containing information for a multiplicity of versions of the 

10 document intended for the different purposes or different users, for a specific purpose 
or for a specific user. 

In accordance with this invention, there is provided a computer system for 
customizing an initial master document in accordance with a user-defined set of 
purpose parameters. In accordance with a further aspect of the invention there is 

15 provided a data structure (i.e., the customizable, "master", document) for specifying 
relationships betw^een elements of the document and between elements of the 
document and their variations. 

A master document will therefore contain all the information that the system 
might need to include in any particular customized version of the document, together 

0 with annotations giving the selection conditions as to when each piece of information 
is relevant and other annotations giving linguistic and formatting information, 
including for multimedia elements of the document. 

A further aspect of the invention provides for a method and apparatus for 
reading said data structure into a form iniplementable on a suitably programmed 

5 processor such that the implementation of the data structure can store both the form 
and content of a master document, i.e., all the elements of the document and their 
variations, and can also act as the process for selecting the relevant variations of the 
document, according to given values of input parameters specifying the intended 
purpose or intended user, and then generating the appropriately customized version of 

f) the document. 



Brief Description of the DraM'ings 
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A better iinderslanding of the invention will be obtained by reference to the 
detailed description of a preferred embodiment below and in regard to the following 
drawings in which: 

Figure 1 is a schematic diagram showing the architecture of the system; 
5 Figure 2 shows a generalised form of a data structure for specifying a customizable, 
master, document according to an embodiment of the present invention; 
Figure 3(a) shows a generalised form of a data substructure for use within tlie main 
data structure for specifying linguistic or presentation format infonnation for a 
component of the master document; 
iO Figure 3(b) shows a generalised form of a data substructure for use within the main 
delta structure for specifying hypertext links to parts of the main data structure or to 
other data structures of the form as specified in Figure 2 and provided in other source 
datafiles; 

Figure 4 is a flowchart of the overall process of reading-in as input the data structure 
15 and generating as output a final customized version of the document; and 

Figure 5 is a graph showing the resolution process for a customized document 
generated according to an embodiment of the present invention; 



Detailed Description of A Preferred Embodiment 

20 Referring to Figure I , an architecture of a customizable document system, the 

^^tailoring engine'\ is shown generally by numeral 10. The system comprises: a 
datafile 12 including a data structure 13 for defining a master document; a parser 14 
for reading in the contents of the datafile 12 and for creating instances of the 
document-class data structures 18 in accordance with the general definitions of 

25 document-class data structures 1 6; a user input interface 20 for reading new values of 
purpose parameters 22 which are input by a selection engine 1 8. The selection engine 
18 uses the current values of the purpose parameters to select the relevant variations 
of each component of the document and to generate appropriately customized 
versions 26 of a document, which may also include hypertext links to new documents 

30 28, which may themselves be customizable documents of the form illustrated in 
Figure 2. 



The term ^'puipose parameter'' as used herein means a parameter used in 
evaluating a selection condition associated with a particular variation of a document 
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structure, where this parameter can be used in defining either a particuhir intended 
purpose or use of a customized version of the document or a particular intended user 
or audience for a customized version. These elements are explained in detail below. 

h urthermore, the datafile may be generated by an authoring tool 30 in 
accordance with the detailed explanation below. 

Referring to figure 2 an embodiment of the data structure 13 according to the 
present invention is described, which shows the blocks, sub-blocks, fields, and 
subfields of the data structure. 

The data structure 13 has the following main blocks of information: 

1 . Identification of purpose parameters and representation-level parameters, 
and their possible values. 

2. Identification of toplevel object (i.e., the main Document). 

3. Definitions of main Document and any subDocuments. 

4 . Definitions of Sections. 

5. Definitions of Topics. 

6. Definitions of Sentences. 

7. Definitions of Lexicals. 

8 . Definitions of Words. 

9. Definitions of Annotations (replaces previous 9. Definitions of Formats). 



20 



10. Definitions of External objects. 



Each of these blocks of the data structure will now^ be described in turn. 



Block 1: The purpose parameters. The first block of the data structure identifies 
25 the purpose parameters, or user parameters, together with their possible values, that 

can be used in forming the Boolean expressions that give the conditions for selecting 

each variation of a document. 

The first block also identifies the representation-level parameters, together 

with their possible values, that can be used in forming the Boolean expressions that 
30 give the conditions for selecting each desired level of representation of the sentences 

in the master document during the process of generating a customized version of the 

document. 
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Block 2: The toplevel object. The toplevel object identifies the document-class 
instance which is the "root" element of the entire document. It is with this object that 
the resolution process for generating a customized version of the document begins. 

5 Blocks 3-9: The program structures. Blocks 3-9 describe the program structures, 
the classes that implement the substructures of the data structure that specify the form 
and content of a customizable document. The program structures are related in the 
following manner: 

A datafile describing a set of program structures is a particular example of a 
iO customizable document created for various uses. The datafile may be divided into 
various parts. 

Firstly, the data structure may be divided into components, or elements, 
refcned to as the classes Document, Section, Topic, Sentence, and Lexical, which 
each implement a substructure of the data structure that defines a component of a 
[5 customizable document. Each such substructure of the data structure also includes the 
variations of a component and the conditions for selecting the appropriate variation of 
a component. 

In addition, the data structure contains basic components which are instances 
of the classes Word and Annotation, and other components, which are instances of the 
20 class ExternaL for linking to other datafiles. 

Block 3: The Documents. 

In Figure 2, Block 3 describes the instances of the class Document. Each Document 
description must specify the following properties: 
25 - A list of its variations. Each variation must be an instance of the class 

DocumentVariation. 

A list of its annotations. Each annotation must be an instance of the class 
Annotation. 

An instance of the class Annotation can specify both textual and non-textual 
30 properties of a document or a component of a document in terms of a 

particular document-layout format, structure, or linguistic representation. For 
example, an Annotation object could specify multimedia elements of the 
document's layout, such as alignment of text, font size, background colour. 
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text colour, and graphics; other Annotation objects could specify linguistic 
information such as discourse relations or coreference links. 
Each variation of a Document class is then described as an instance of the class 
UocunientVariation. Each DocumentVariation description specifies the following 
5 properties: 

- The condition for selecting this variation. The condition must be a 
Boolean expression composed from pairs of purpose parameters and their 
allowable values. 

- A list of the components of this variation. Each component must either be 
10 an instance of the class Document or an instance of the class Section. 

- A list of annotations for this variation. Each annotation must be an 
instance of the class Annotation. 



Block 4: The Sections. Block 4 describes the instances of the class Section. Each 
1 5 Section description specifies the following properties: 

- A list of its variations. Each variation must be an instance of the class 
SectionVariation. 

- A list of its annotations. Each annotation must be an instance of the class 
Annotation. 

20 Each variation of a Section class is then described as an instance of the class 
SectionVariation. Each SectionVariation description specifies the following 
properties: 

- The condition for selecting this variation. The condition must be a 
Boolean expression composed from pairs of purpose parameters and their 

25 allowable values. 

- A list of the components of this variation. Each component must either be 
an instance of the class Section or an instance of the class Topic. 

- A list of the annotations for this variation. Each annotation must be an 
instance of the class Annotation. 

30 

Block 5: The Topics. Block 5 describes the instances of the class Topic. Each 
Topic description specifies the following properties: 

- A list of its variations. Each variation must be an instance of the class 
Topic-Variation. 
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- A list of its annotations. Each annotation must be an instance of the class 
Annotation. 

Each variation of a Topic class is then described as an instance of the class 
TopicVariation. Each TopicVariation description specifies the following properties: 
5 - The condition for selecting this variation. The condition must be a 

Boolean expression composed from pairs of puipose parameters and their 
allowable values. 

- A list of the components of this variation. Each component must either be 
an instance of the class Topic or an instance of the class Sentence. 

10 - A list of the annotations for this variation. Each annotation must be an 

instance of the class Annotation. 

Block 6: The Sentences. Block 6 describes the instances of the class Sentence, 
Each Sentence description must specify the following properties: 
15 - A list of its variations. Each variation must be an instance of the class 

SentenceVariation. 

- A list of its annotations. Each annotation must be an instance of the class 
Annotation, 

Each variation of a Sentence class is then described as an instance of the class 
20 Sentence-Variation. Each SentenceVariation description must specify the following 
properties: 

- The condition for selecting this variation. The condition must be a 
Boolean expression composed from pairs of purpose parameters and their 
allowable values. 

- A list of the components of this variation. 

- A list of the representations of this variation. Each representation must be 
an instance of the class SentenceRepLevel. 

- A list of annotations for this variation. Each annotation must be an 
instance of the class Annotation. 

30 Each SentenceRepLevel description must specify the following properties: 

- The condition for selecting this sentence representation, The condition 
must be a list of one or more representation-level parameters. 

- A list of the components of this variation. Each component must bean 
instance of the class Lexical. 
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A list of the annotations for this variation, 
instance of the class Annotation. 
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Each annotation must be an 



In the current implementation of the system, an instance of the class 
5 SentenceRepLevel may be a character string, with any Lexical components identified 
by surrounding reserved characters. This is a simplification made for ease of testing 
system prototypes, and does not limit the scope of the invention. 



10 Block 7: The Lexicals. Block 7 describes the instances of the class Lexical. Each 
Lexical description must specify the following property: 

A list of its variations. Each variation must be an instance of the class 
LexicalVariation. 

A list of its annotations. Each aimotation must be an instance of the class 
15 Annotation. 

Each variation of a Lexical class is then described as an instance of the class 
LexicalVariation. Each LexicalVariation description must specify the following 
properties: 

The condition for selecting this variation. The condition must be a 
20 Boolean expression composed from pairs of purpose parameters and their 

allowable values. 

A character string associated with this LexicalVariation instance. 

A list of annotations for this variation. Each annotation must be an 

instance of the class Annotation. 
25 - A single component, which may be an instance of any of the classes 

External, Word, Lexical, Sentence, Topic, Section, or Document. Each of 

these cases is dealt with as follows: 
L If a component is a Word, then it is treated as a simple string to be 

concatenated to the result returned. 
30 2. If a component is a Lexical, then it is treated as a set of variations of a 

word, which will be resolved to select the appropriate version of the word. 

In this way, near-synonymy can be handled within the system. 
3. If a component is a Sentence, Topic, or Section, then it is treated as a set 

of 
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10 

Block 8: 

Woid description must specify its associated string and its associated annotations. 
15 Each annotation must be an instance of the class Annotation. 

Block 9: The Annotations. Block 9 describes the instances of the class Annotation 
that will be used to insert all the relevant linguistic and formatting information into 
the customized version of the document to be output. A description of block 9 is not 

20 included in figure 2. Instead, the general structure of an Amiotation class object is 
shown in figure 3(a). 

The Annotation objects can be grouped into several distinct sub-taxonomies, 
one for each type of linguistic or formatting annotations that will be attached to the 
mam mastc! -document data structure. For example, one Annotation sub-taxonomy 

25 might specify details of HTML layout for the overall document and each component 
of the document; another Annotation sub-taxonomy might record properties of the 
discourse structure of the overall document and each component of the document, 
such as rhetorical relations and coreference links. Each Annotation object has a 
property ^'parent" to reference its immediate ancestor in its (sub")taxonomy. 

30 

Block 10: The Externals, Block 10 describes the instances of the class External , 
which will be used to create hypertext links to other customizable documents 
specified in other datafiles. 

Each External description must specify the following attributes: 
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variations of a separate piece of a document, which will be resolved to 
select the appropriate version. 

In this way, adaptive hypertext can be handled within the system. 
If a component is a Document, then it is treated as a whole complete 
document. 

In this way, hypertext links to other documents internal to the same 
datafile can be handled within the system. 

If a component is an External object, then it is treated as a whole complete 
customizable document. 

In this way, hypertext links to other customizable documents can be 
handled within the system. 

The Words. Block 8 describes the instances of the class Word. Each 
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- The name of the file containing the external customizable document. 

- A user profile. The user profile is a list of parameters that describe the 
user or audience for whom the document customization is being 
performed. 

5 The general structure of an External class object is shown in figure 3(b). 

The data structure 13 according to the present invention allows an author to 
describe the structure of a customizable document (i.e., a master document). The data 
structure has a recursive and object-oriented form and can be implemented using an 
object-oriented programming language so that a customizable document described in 
10 the form of the data structure can be implemented as an object-oriented computer 
program. 

The elements of the data structure are related by both part relationships and by 
inheritance relationships, so that the relationships between the elements of a 
customizable document described in the form of the data structure and implemented 
15 as a corresponding object-oriented computer program can be recognized and 

maintained by the object hierarchy and inheritance mechanism of an object-oriented 
computer program. 

Moreover, an object-oriented computer program that implements the data 
structure is both the form and content of a customizable document and the process for 
20 selecting and generating an appropriately customized version of the document. Thus 
the data structure is generic in the sense that it can implement any customizable 
document given in the form of the data structure. 

The data structure 13 describes the corpus text of a specific customizable 
document (i.e., a master document) having elements and structure, as shown in figures 
25 2(a) , 2(b) and 2(c) following. 

The form of the data structure is defined by the set of general class objects 16 
describing the elements and structure of a customizable document in terms of object- 
oriented program structures. These class objects are related by both part relationships 
and by inheritance relationships to be explained below. 

The operation of the system 1 0 (the "tailoring engine") may be explained as 
follows: 

The parser program 14 reads in and parses the master-document datafile 12 to 
recognize its structure and then maps the contents of the input datafile into class 
objects of name according to class names specified in the input data structure. 
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Properties of the classes are also recognized according to information contained in the 
data structure. 

The parser program 1 4 also acts as a document-class instantiator program 
which uses the parsed contents of the input datafile to dynamically create instances of 
5 the program structures identified by the general class objects described above. 
Properties of the classes are also dynamically assigned according to information 
contained in the data structure. 

An integration of data structures with the main process of the system 
comprises the following: 
»0 The instances of the class objects dynamically generated from the input 

datafile, winch provide two simultaneous functions: 

1 . The program data structures describing the elements, structure, and content 
of the master document. 

2. The selection process for generating the appropriate customized version of 
15 the document. 

Thus, a feature of the system is that given the current values of the purpose 
parameters, the instances of the program data structures, that is to say, the instances of 
the class objects describing the elements of the master document, execute themselves 
to select and generate the appropriate customized version of the document. 
20 The core of the system, that is, the integration of program data structures with 

the selection process, is generic in terms of the following properties: 

Application-independent. The system core is independent of the application: 
the only items that need be re-defined for a new application are the input datafile and 
the interface for reading the current values of the purpose parameters. This is 
25 discussed further below. 

Platform-independent. The system core is currently implemented in the Java 
programming language, so is platform-independent to the same extent as Java itself 

Processor-independent. The system core is currently implemented in the 
Java programming language, but is independent of the underlying programming 
30 language, i.e., processor, to the extent that the programming language used must 

provide an object-oriented paradigm and a semantics for property inheritance that is 
consistent with the specification of the resolution process used in the system for 
generating a customized version of a document from the instances of the program data 
structures. 
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1 lie process of generating a customized version of a document from the 
instances of the program structures is referred to as resolution and will be discussed 
later with reference to Figure 5. 

A customized version of a document can be generated in any number of 
5 different levels of representation of its content (e.g., surface English; a syntactic or 
semantic representation to be used by a text-repair facility; and so on). The different 
representations to be generated for any given application must be indicated by the 
representation-level parameters in the description of the document given by the data 
structure in the input datafile. This information is specified in block 1 of the data 

10 structure as described with reference to figures 2(a), 2(b), and 2(c) above. 

Each different representation of the content of a customized version of the 
document wall be generated along with a list of all the relevant annotations to the 
content. These annotations provide information on the multiple fonns in which the 
document ma>' subsequently be presented to a reader, and the linguistic information 

15 that may be used to guide subsequent repair of the customized document. The 

customized document that is generated will also include ail annotations to the content 
concerning the External objects that can be used to provide hypertext links to other 
customizable documents or to other applications of the system 10 (the ''tailoring 
engine") from figure 1. 

20 The process of generating a customized version of a master document is 

shown in Figure 4. lliere are two main stages to this process, the initial setup and the 
main program loop. In the initial setup, the input datafile is read in and parsed, and 
the appropriate instances of the various document classes described above are created. 
The parser program 14 reads in the input datafile 12, which contains the data structure 

25 1 3 giving the specification of a customizable document. As it reads in the file, the 
driver program also acts as a class instantiator to create instances of all relevant 
document classes according to the input data structure. Program links, i.e., references, 
are created between these class instances via the setting of their properties and 
assignment of their property values, 

30 In each iteration of the main program loop, new values of the purpose 

parameters are read in, and a customized version of the document is generated as 
output for each specified level of representation. A user interface in the form of a 
reader program 20 obtains the new values of the purpose parameters. In the latter 
instance, the parameter values may be entered interactively or may be read in from 
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previously compiled profiles of user preferences stored in computer databases. This 
allows for the mass customization of information for high volumes of individual users 
with diverse characteristics, such as in the mass production of personalised financial 
investment advice. 

5 A selection engine 1 8 resolves the document instances created in the setup 

stage according to the current values of the purpose parameters to generate the 
appropriately customized version of the document. The customized document is 
output in all specified levels of representation with all relevant linguistic and 
formatting information attached to each component of the document If there arc no 
10 more new purpose-parameter values to read in, the main program loop terminates. 

The process of selecting the appropriate variation of each document structure 
is called "resolution". Figure 5 is a graph showing the resolution process for a 
customized document generated according to an embodiment of the present invention. 
The pseudocode for the Resolve procedures, which implement the resolution process, 
15 is as follows: 

Procedure: Resolve (for VariationContainer classes) 
Parameters 

- (input) 

An instance of WorkingCondition 
20 (a list of purpose parameters and their current values) 

- (input) 

An instance of List 

(a list of the desired representation levels to output) 

- (output) 

25 An instance of DocumentObjectSet, 

i.e.. a set of references to resolved DocumentObjects, 

with one set member for each desired level of representation 

of this VariationContainer instance 

30 Algorithm: 

if Resolve has already been called wdth this WorkingCondition then 

result is the previously saved result 
else 

call Satisfies on each variation of this Variation Container until 
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the WorkingCondition is satisfied 
call Resolve on this satisfying variation to 

return a DocumentObjectSet 
add the appropriate annotations for this VarialionContainer into 
5 the returned DocumentObjectSet 

end if 

if no variation satisfies the WorkingCondition then 
result is null 

else 

10 result is the annotated DocumentObjectSet for this VariationContainer 

end if 



Procedure: Resolve (for Variation classes) 
Parameters: 
1 5 - (input) 

An instance of WorkingCondition 
- (input) 
An instance of List 

(a list of the desired representation levels to output) 
20 - (output) 

An instance of DocumentObjectSet, 
i.e., a set of references to resolved DocumentObjects, 
with one set member for each desired level of representation 
of this Variation instance 

25 

Algorithm: 

create a new DocumentObjectSet for this Variation 
for each component of this Variation do 
call Resolve on the current component to 
30 return a DocumentObjectSet for this component 

add the appropriate annotations for this Variation into 

the returned DocumentObjectSet 
attach the annotated DocumentObjectSet 

as a child of the DocumentObjectSet for this Variation 
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end for 

result is the annotated DocumentObjectSet for this Variation 

The '^resolution" of the appropriate annotations for a given input data structure is 
5 handled as follows: 

Each instance of a document class in the input data structure, down to the level 
of a Word instance, has an ''annotations" property, which may be null, associated with 
it. This property is a list of all the Annotation objects that apply to this document-class 
instance and must contain one Annotation object for each of the distinct sub- 
10 taxonomies in the overall class of Annotation objects. 

When a document-class instance is being resolved, its set of Annotation 
objects, as specified in its ^^annotations'' property, will be collected and included in 
the content of the customized document. 

In summary, an application system which wishes to put the invention into 
1 5 practice tnust execute the following steps: 

1 . An input datafile of the structure as described with reference to figures 
2(a), 2(bj, and 2(c) is created; 

2. A user interface in the form of a reader program to obtain new values of 
purpose parameters is provided. This reader program returns as output an 

20 instance of WorkingCondition which is a set of purpose parameters and 

their values (implemented as a hashtable in one embodiment of the 
invention). 

3. The main program first reads in the datafile, then calls the reader program 
to obtain a new instance of WorkingCondition. The main program will 
^hen start the resolution process for the toplevel Document object by 
passing it the WorkingCondition. 

4. The output of each iteration of the main program will be a customized 
version of the document in all the levels of representation specified in the 
input datafile using the representation-level parameters. Each 
representation of the customized document is output for possible later 
processing by the application system. 
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Siiinple Data Structures 

Selections from two sample data structures are presented in this section. The 
first example is for the customizable home page of the HealthDoc Project at the 
5 University of Waterloo (Waterloo, Canada). The second example is for a master 
document giving basic health information on diabetes. 

Note that in the Lexical Variation labelled "lexDiabetesMaster-a" in Example 1, there 
is 

iO a link to another completely separate customizable document, contained in a different 
input datafiie. This link is made through an instance of the External class. 

Example 1: A Customizable Web Page 

15 

// The parameters 
PurposeParameters 
|role==CLexpert physician layperson funder& 
technical =high \o\w& 
20 agc^senior adult child& 

formality^formal informal& 
coolness=cool bland| 

RepresentationLevelParameters 
25 |levels^english| 

// The top-level object 
loplevel=Document.vvebbedoc 

30 // The Documents and DocumentVariations 
Document webbedoc 
|title=-"The HealthDoc Project Home Page''& 
variations=^doc-a doc-b doc-c doc-d doc-e doc-f doc-g doc-h& 
annotations=html-doc-webbedoc-toplevel| 

35 

DocumentVariation doc-a 
|condition=(and (coolness cool) (age aduU) (role CLexpert))& 
componentList^Section.secl Section.sec2 Section. sec3 Section. sec4 
Section.sec5 Section.sec6 Section. sec7& 
40 annolations=html-doc-webbedoc-doc-a| 

DocumentVariation doc-d 
|condition=(and (coolness bland) (age adult))& 
componentList=Section.secl Section. sec2 Section. sec3 Section. sec4 
45 Section. sec5 Section. sec6 Section.sec7& 

annotations=^html-doc-webbedoc-doc-d| 

// The Sections and SectionVariations 
Section seel 

50 |variations=secla sec lb seclc secld secle seclf 
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seclg seclh secli seclj sec Ik sec 11 
seclni sec In seclo seclp seclq seclr& 
annotatioiis=html-scc-vvcbbcdoc-sccl| 

5 Section Variation sec 1 a 
Icondition- 

(ancl (role funder){teclinical all)(coolness biand)(formality fonnal))& 
coniponentList=Section.subsec 1 - 1 Section.subsec 1 -2& 
annotations=html-sec-\vebbedoc-sec 1 a| 

10 

Section subsecl-1 
I variations=subsec 1 - 1 a& 
annotations^html-scc-webbedoc-subsec I - 1 1 

1 5 Section Variation subsec 1 - 1 a 
|condition=()& 

coinponenlList=Topic.topic 1 & 
annotations^lUml-sec-webbedoc-subsec 1 - 1 1 

20 Section sec2 

|variations=sec2a sec2b& 
annotations=html-sec-webbedoc-sec2| 

Section subsec2-] 
25 I variations=subsec2- 1 a& 

annotations^html-sec-\vebbedoc-subsec2- 1 1 

SectionVariation siibsec2-la 
|condition=()& 

30 componentList=Topic.topic4 Topic.topicS Topic.topic6 Topic.topic7& 
annotations=html-sec- webbedoc-subsec2- 1 1 

// The Topics and TopicVariations 
Topic iopic4 
35 |variations=topic4a& 

annotations=html-topic-webbedoc-default| 

1 opicVariation topic4a 
|condition=()& 

40 componentList=Sentence.sent4a- 1 Sentence.sent4a-2& 
annotations=html-topic-webbedoc-default| 

Topic topics 

A5 |\'ariations=lopic5a topicSb topicSc topic5d& 
annotations^htnil-topic-webbedoc-default| 

TopicVariation topic5a 
|condition={and (role physician) (technical high))& 
50 componentList=Sentence.sent5a- 1 & 
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annotalions^lunil-lopic-vvebbedoc-defaultj 

TopicYariation topicSc 
tcondilion=(and (not (role physician)) (technical high))& 
s c()!npi)nentList=Sentcnce.sent5c-l& 

annotalions=htnil-topic-webbedoc-defauIt| 

'lopic topic? 
|variaiions=topic7a topic7b& 
10 annotations=hlm]-topic-webbedoc-default| 

TopicVarialion topic7a 
|condition=( technical low)& 
coniponentList=Sentence.sent7a- 1 & 
1 5 annotalions==html-topic-\vebbedoc-default| 

TopicVarialion topic7b 
|condiiion=(lechnical high)& 
componentLisi^Sentence.sent7b- 1 & 
20 annoiations=htm]-topic-webbedoc-default| 

Topic topic-compliance 
I vai iations=topic-compliance-a& 
annotaiions=himil-topic-webbedoc-default| 

25 

TopicYariation topic-compliance-a 
|condition=()& 

componentList=Sentence.sent-compliance-l Sentence. sent-compliance-2 
Sentence. sent-compliance-3 Sentence. sent-compliance-4& 
30 annotations=html-topic-webbedoc-defauit| 

// The Sentences and SentenceVariations 
Sentence sent4a-l 
35 |variations=scnt4a- 1 i& 

annotations=htnil-sent-webbedoc-default| 

SentenceYariation senl4a-li 
|condition=()& 
40 componentList=()& 

levelList^SentenceRepLevel.sent4a- 1 i-english& 
annotations=html-sent-webbedoc-default| 

SentenceRepLevel sent4a- 1 i-english 
45 |repLevcl==english& 

componentList=''Why do we want to be able to produce tailored 

documents?"& 
annotations=html-sent-webbedoc-default| 

50 
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Sentence senl4a-2 
|variations=sent4a-2i& 
annotations^html-senl-vvebbedoc-default| 

5 SentenceVariation sent4a-2i 
|conciilion=()& 
componenlList=()& 

level Li st=SentenceRepLeveI.sent4a-2i-english& 
annotalions=htmi-sent-webbedoc-default| 

K) 

SentenceRcpLevel sent4a-2i-englisli 
|repLevel=engIish& 

componentList^"Bccause research in communication has shown that 
people pay more attention to messages that are aimed 
15 just at them.''& 

annotations=html-sent-webbedoc-default| 

Sentence sent5a-l 
|variations=sent5a- 1 i& 
20 annotations=htnil-sent-webbedoc-default| 

SentenceVariation sent5a-li 
|condition^()& 
coniponentList=()& 
25 levclList=SentenceRepLeveLsent5a-li-engIish& 
annotations=html-sent-webbedoc-default| 

SentenceRcpLevel sent5a-l i-english 
|repLevei-english& 

30 componentList=''Studies have shown that health information that is tailored 
to a patient's specific medical condition and personal characteristics 
is much more effective than generic information in influencing 
^lexCompliance'' and subsequent outcome. 
annotations=html-sent-webbedoc-default| 

35 

Sentence scnt7a-l 
[variations^ sent7a-li& 
annotations=html-sent-webbedoc-default| 

40 Sentence V ariation sent7a- 1 i 
|condition=()& 
componentList=()& 

icvelList==SentenceRcpLevcl.scnt7a- 1 i-cnglishtS: 
annotations=html-sent-webbedoc-default| 

45 



SentenceRcpLevel sent7a- 1 i-english 
50 |repLevel=english& 
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coniponentList^'^But it can be very difficult to write and keep track 

of many versions of the same '^lexSynonymsl^."& 
annotations=html-sent-\vebbedoc-default| 

5 Sentence sent8a-l 

I varialions=sent8a- 1 i& 
annotalions=html-sent-webbedoc-default| 

Sentence Variation sent8a-li 
10 |condition=()& 

componentList=()& 

levelList^SentenceRepLeveLsentSa- 1 i-english& 
annoialions=html-sent-vvebbedoc-defaiilt| 

i 5 SentcnceRepLevel sentSa- 1 i-english 
|repLevel=english& 

coniponenlList^^'What is needed is a computer system for the production 
of tailored health-information and patient-education 
documents, that would, on demand, customize a 'master 
20 document' to the needs of a particular individual."& 

annotations=html-sent-webbedoc-default| 



Sentence sent8a-2 
25 |variations=sent8a-2i& 

annotations=html-sent-webbedoc-default| 

SentenceVariation sent8a-2i 
|condition=()& 
30 componentList=()& 

levelList^SentenceRepLevel.sent8a-2i-english& 
annotations=^html-sent-webbedoc-defaultl 

SentenceRepLevel sent8a-2i-english 
35 |repLevel=english& 

componentList=='The HealthDoc project has currently built the first 

"^lexDiabetesMaster"^ of such a system. "& 
annotations=html-sent-webbedoc-default| 

40 Sentence sent-compliance-1 

|variations=sent-compliance-la& 
annotations=html-sent-webbedoc-default| 



45 

SentenceVariation sent-compIiance-1 a 
|candition=()& 
componentList=()& 

levelList=SentenceRepLevel.sent-compliance- 1 a-english& 
50 annotations=^html-sent-webbedoc-default| 
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ScnlenceRcpLevel sent-compliance-la-english 
jrcpl .cvcl^cnglislu^ 

coinponcntList=="Recenl experiments have shown thai health-education 
3 material can be much more effective if it is 

cusloiuizcd for the individual reader in accordance 
with their medical conditions, demographic variables, 
personality profile, or other relevant factors."& 
annotations=htm]-sent-webbedoc-default| 

10 

Sentence sent-compliance-2 
|variations=sent-compiiance-2a& 
annotations=html-scnt-webbedoc-defaultl 

1 5 SentenceVariation sent-compliance-2a 
|condition=()& 
componentList=()& 

]evelList--SentenceRepLeveLsent-compliance-2a-english& 
annotations=htm]-sent-webbedoc-default| 

20 

ScnienceRepLevel sent-compliance-2a-engHsh 
(repLevcl=english& 

coniponentList=*Tor example, Dr Victor Strecher (now at the 
Comprehensive Cancer Center of the University of 
^5 Michigan) and colleagues sent unsolicited leaflets to 

patients of family practices on topics such as giving 
up smoking, improving dietary behaviour, or having a 
mammogram. 
annotations=html-sent-webbedoc-default| 

30 

Sentence scnt-compiiance-3 
|variations=sent-compliance-3a& 
annotations==html-sent-webbedoc-default| 

SentenceVariation sent-compliance~3a 
|condition=()& 
componentList=()& 

levelList=SentenceRepLeveI.sent-compliance-3a-english& 
annotations-html-sent-webbedoc-defaultj 

40 

SentenceRepLevel sent-compliance-3a-english 
|repLevel=english& 

componentList- 'Each leaflet was 'tailored' to the recipient, on the 
basis of data gathered from them in an earlier survey. "& 
45 annotations=html-sent-webbedoc-default| 

Sentence sent-compliance-4 
|variations=sent-compliance-4a& 
annotations=html-sent-webbedoc-default| 

50 
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Sentence Variation scnt-compliance-4a 
lconciition^()& 
componenlList=()& 

levelList=SentenceRepLeveLsent-compliance-4a-english& 
5 annolalions=html-scnt-webbedoc-default| 

SentenceRepLevel sent-compliance-4a-english 
|repLevel=english& 

componentList='*In each study, the 'tailored' leaflets were found to 
10 have a significantly greater effect on the patients' 

behaviour than 'generic' leaflets had upon patients 
in a control group."& 
annotations=html-sent-webbedoc-default| 

15 

// The Lexicals and LexicalVariations 
Lexical lexSynonymsl 
|variations=lexSynonymsla lexSynonymsl b& 
20 annotations=| 

Lexical Variation lexSynonymsl a 
|condition^(role physician)& 
string=''brochure"& 
25 value=Word.biochure& 
annotations^! 

Lexical Variation lexSynonymslb 
|condition=(not (role physician))& 
30 string=''informaiion"<S: 

value- Word. information& 
armolations=| 

Lexical lexCompliance 
35 |variations=lexCompIiance-a& 
annotations^! 

1 >exical Variation lexCompliance-a 
|condition=(role physician)& 
40 string="tailored health-information"& 
value=Topic.topic-compliance& 
annotations^! 

Lexical lexDiabetesMaster 
45 |variations=^lexDiabetesMaster-a& 
annotations^^! 



50 
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Lexical Variation lexDiabetesMaster-a 
|condition"()& 
string~''pmlotype"& 
value=Exlernal.exlernal-diabeteS(& 
5 annotations=| 

// The Words 
Word brochure 
|value^"brochure''| 

10 

Word information 
|value="information"| 

// '1 he Annotations 
15 Annotation html-doc-vvebbedoc-toplevel 
title='The HealthDoc Home Page"| 

Annotation html-doc-webbedoc-doc-a 
[background- 'blue2.gir'& 
20 bgcolor="#ffffer& 
vHnk='#990099"& 
alink='^#990099"& 
link="#990099"& 

title-begin="<center><font size - +4 color - V'990099\">''& 
25 title"end="</font></center>"& 

image-dip='V~healthdo/images/"& 

image-align=''alt"& 

parent=htmI-doc-\vebbedoc-toplevel| 

30 Annotation html-sec-webbedoc-secl 

ltit!e-begin=^^<font size - +1 color = \"990099V'>"& 
title-end-'^</font>"& 

title='The goal of the HealthDoc project"& 
parent^html-doc-webbedoc-toplevel| 

35 

Annotation html-sec-webbedoc-secl a 
iimage="businmen.jpg''& 
parent=html-sec-webbedoc-sec 1 1 

40 Annotation html-sec-webbedoc-sec2 

|title-begin="<font size = +1 color = \"990099\">"& 

title-end-"</font>"& 

title=''The motivation for the research''& 

parent=^html-doc-webbedoc-toplevel| 

45 

// The External objects 
External external-diabetes 
|rileName="diabetes.master"& 

profile=<list of parameters describing current user/audience>| 

50 
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Example 2: Customizable Health Information 

// The parameters 
5 PiirposeParameters 

I iype=insulin-dependent non-insulin-dependenl& 
technical=high-technical moderate-technical low-technical& 
agc=senior adult young-adult child& 
locus-of-conlrol=doctor patient| 

10 

RepresentationLevelParameters 
lenglish spl| 

// The top-level object 
] 5 topleveI=Document.diabetes 

// riie Documents and DocumentVariations 
Document diabetes 
Ititle- Treating Your Diabetes"& 
20 variations^doc-a<S: 

annotations=htm!-doc-diabetes-toplevel discourse-doc-diabetes-toplevel| 

DocumentVariation doc-a 
|condition=()& 

25 componentList-Section.secl Section. sec2 Section. sec3 Section. sec4& 
annotations=htmI-doc-diabetes-doc-a discourse-doc-diabetes-toplevel| 

// The Sections and SectionVariations 
Section seel 
30 I variations=sec 1 a& 

annotations^html-sec-diabetes-secl discourse-sec-diabetes-sec 1 1 

SectionVariation sec la 
|condition=()& 

35 componentList^Section.subsecl-1 Section,subsecl-2& 

annotations=html-sec-diabetes-sec 1 a discourse-sec-diabeles-sec 1 1 

Section subsecl-1 
I variations==subsec 1 - 1 a& 
40 annotations=html-sec-diabetes-subsec 1 - 1 discourse-sec-diabetes-sec 1 1 

SectionVariation subsecl-1 a 
|condition=()& 

componentList=Topic.topicl & 
45 annotalions=html-sec-diabetes-subsec 1 - 1 a discourse-sec-diabetes-sec 1 1 

Section subsecl-2 
I variations=subsec 1 -2a& 

annotations=^html-sec-diabetes-subsec 1 -2 discourse-sec-diabetes-sec 1 1 

50 
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Section Variation subsecl-2a 
|condilion=()& 

c()rnp(>ncn(Lisl-'I'opic.lopic2 Topic. topic3 Topic. topic4 
Topic. topics Topic.topic6& 
5 annotations=html-sec-diabetes-subsec I -2a discourse-sec-diabeles-sec 1 1 

// The Topics and TopicVariations 
Topic topic] 
1 varialions=topic 1 a& 
1 0 annotations=hlml-topic-diabetes-default discourse-topic-diabetes-topic 1 1 

Topic Variation topic la 
|condilion=()& 

componcntList^Sentence.sent 1 a- 1 Sentence. sent I a-2& 
1 5 annotations=html-topic-diabetes-default discourse-topic-diabetes-topic 1 1 

Topic topic3 
|varialions==lopic3a topic3b& 

annotations=html-topic-diabetes-default discourse-topic-diabetes-topic3| 

20 

TopicVariation lopic3a 
|condition=(type insulin-dependent)& 
componentList=^Sentence.sent3a-l& 

annotations=html-topic-diabetes-default discourse-topic-diabetes-topic3| 

25 

TopicVariation topic3b 
|condition=(type non-insulin-dependent)& 
componentList=Sentencc.sent3b-l& 

annotations=html-topic-diabetes-default discourse-topic-diabetes-topic3 

30 

// The Sentences and SentenceVariations 
Sentence sentla-1 
I variations=^sent 1 a- 1 i& 

annotations=html-sent-diabetes-default discourse-sent-diabetes-default| 

35 

Sentence Variation sentla-li 
|condition=()& 
componentList=()<S: 

level List^SentenceRepLevel. sent 1 a- 1 i-english SentenceRepLevel .sent 1 a- 1 i-spl& 
40 annotalions=htnil-sent-vvebbedoc-default| 



SentenceRepLevel sent 1 a-] i-engiish 
|repLevel=english& 

45 componentList^LexicaLlexDiab LexicaLlexis Lexical. lexa Lexical.lexgroup 
Lexical. lexof Lexical. lexconds Lexical. lexin Lexical. lexwhich 
Lexical. lexglucose Lexical. lexlevels Lexical. lexare 
Lexical. lexabnormally Lexical. lexhigh.& 
annotations=html-sent-diabetes-default discourse-sent-diabetes-default| 
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ScnienccRepLcvcl sent 1 a- 1 i-spl 
|repLevcl=spl& 

componentList='^<a Sentence Flan Language (SPL) form>''& 
annotations=hlml-sent-diabetes-default discourse-sent-diabetes-defaultl 

5 

Sentence sent3a-l 
|variations=sent3a-li& 

annotalions=html-sent-diabetes-default discourse-sent-diabetes-default| 

10 SentenceVariation sent3a-li 
|condition=()& 
componentList=()& 

levelList=SentenceRepLevel.sent3a-li-english SentenceRepLevel.scnl3a-l i-spl& 
annotations=html-sent'\vebbedoc-default| 

15 

SentenceRepLevel sent3a- 1 i-english 
|repLevel=english& 

coniponentList^'The condition that you have is insuHn-dependent diabetes. 
annotations=htmI-sent-diabetes-defauh discourse-sent-diabetes-defauhi 

20 

SentenceRepLevel sent3a-l i-spl 
|repLevel=spl& 

componentList^"(asc / ascription 
riense present 
25 :domain (condl / abstraction 

:lex condition 
[determiner the 
:process (have / ownership 
:lex have-possession 
30 :tense present 

:domain (hearer / person) 
: range cond)) 
:range (diab2 / abstraction 
:lex diabetes 
35 [determiner zero 

:property-ascription (ins / quality 

:lex insulin-dependent))y'& 
annotations=html-sent-diabetes-default discourse-sent-diabetes-default| 
// The Lexicals and LexicalVariations 
40 Lexical lexDiab 

|variations=lexDiab-a& 
annotations=| 

LexicalVariation lexDiab-a 
45 |condition-()<& 

string=''Diabetes"& 
value=Word.Diab& 
annolat!ons=| 



31 



wo 99/08205 



PCT/CA98/00771 



Lexical lexglucose 
|variations=lexglucose-a& 
annotnlions-| 

5 LexicalVariation lexglucose-a 
|condition=()| 
stnng=''glucose"& 
valuc=^Word.glucose& 
annotations^! 

10 

Lexical lexhigh 
|variations=lexhigh-a& 
annolalions^l 

15 LexicalVariation lexhigh-a 
|condition=()| 
slring=''high''& 
value=Word.high& 
annotations^! 

20 

// The Words 
Word Diab 
!value— "Diabetes"&| 

25 Wold glucose 

|value==''glucose''! 

Woid high 
|value=-'iiigh"&| 

30 



// The Annotations 

35 // // HTML Annotations 

Annotation html-doc-diabetes-loplevel 
Ititle- 'About Your Diabetes"! 

Annotation htnil-doc-diabetes-doc-a 
40 ibgcoloi-"#ffffer'& 

title-begin-"<hl aIign-V^center\">^'&title-end= '</hl>"& 
parent=htm]-doc-diabetes-toplevel| 

Annotation html-sec-diabetes-defauh 
45 // <default HTML markup for any Section in diabetes document>| 

Annotation html-sec-diabetes-sec 1 
|titIe="Basic information"& 
parent=html-sec-diabetes-default! 

50 
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Aiinolalion hlinl-scc-diabetes-sccla 
|title-begin=^^<h2 align=\"centerV'>"&title-end-"</h2>"& 
parent^html-sec-diabetes-sec 1 1 

5 Annotation html-scc-diabetes-subsecl -1 
[title- ^Whal is diabetes?"& 
parent=html-sec-diabetes-secl I 

Annotation html-sec-diabetes-subsec 1 - 1 a 
10 |section-end=^^<p>"& 

title-begin="<h3>"&title-end="</h3>"& 
parent=htnil-sec-diabeles-subsec 1 - 1 1 

Annotation htnil-sec-diabetes-subsecl -2 
1 5 |title=="The two types of diabetes"& 
parent=hlmi-sec-diabetes-sec 1 1 

Annotation html-sec-diabetes-subsec 1 -2a 
|scction-end=--<p>"& 
20 title-bcgin-"<h3>"&title-end- '</h3>"& 
parent^htnil-sec-diabetes-subsecl-2| 

// // Linguistic Annotations 
Annotation discourse-doc-diabetes-toplevel 
25 |title-"About Your Diabetes"| 

Annotation discourse-sec-diabetes-sec 1 
|title=^''Basic inronnation"& 
relations=^''((o^d < (topic2a topic3a)) 
30 (ord < (topic2a topic3b)) 

(ord < (topic2a topic4a)) 

(ord < (topic2a topic5a)) 

(ord < (topic2a topic5b)) 

(ord < (topic2a topicSc)) 
35 (ord < (topic2a topic5d)) 

(ord < (topic2a topic6a)) 

(ord < (topic2a topic6b)) 

(ord < (topic3a topic6a)) 

(ord < (topic3b topic6b)) 
40 (elaboration topic4a topic3b) 

(justification topic2a topic3a) 

(justification topic2a topic3b) 

(^justification topic2a topic4a) 

(elaboration topic4a topic2a) 
45 (elaboration topicSa topic3a) 

(elaboration topicSb topic3b) 

(elaboration topicSc topic3a) 

(elaboration topicSd topic3b) 

(Justification topic3a topic6a) 
50 (justification topic3b topic6by'| 
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Annotation discoursc-sec-diabctes-subsec 1 - 1 
|tillc-"What is diabclcs?^V^ 
parcnt=discourse-sec-diabetes-sec 1 1 

5 

Annotation discourse-sec-diabetes-subsccl -2 
|litlc^''Tbe two types of diabetes"& 
parent=discourse-sec-diabetes-secl| 

0 Annotation discourse-topic-diabetes-topic3 
|corefs='X(condl specific cond) 
(diab2 specific diab) 
(condl generic diab2))''<S: 
parent=discourse-sec-diabetes-sec 1 1 
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I'hc program objects are: 

Basic(Jbject. 
5 RcsolvableObject, 

DocumentObjecl. 

DociimenlObjectSel. 

VariationContainer, 

Variation. 
10 Condition. 

WorkingCondition. 

Annotation. 

External. 

15 

1 he Procedures 

Procedures used by BasicObject classes 

These procedures are used within BasicObject classes (e.g., BasicDocument, 
20 BasicSection, etc.) 

Procedure: SetPropcrties 

Parameter: 

A list of property names and their corresponding values 

25 

Algorithm: 
for each property in the list do 
set its value 

30 Procedures used by VariationContainer classes 

These procedures are used within VariationContainer classes (e.g., Document, 
Section, Topic), which have a list of variations. VariationContainer classes are 
extensions of ResolvableObject and therefore have a Resolve method. 
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P roced u re : Sc tP ropcrt i es 

l^aiameter: 

A list of property names and their corresponding values 
Algorithm: 
5 for each property in the list do 

if property is a list of variations then 

set the class property "variations'' to given value 
else if property is a list of annotations then 
set the class property "annotations'' to given value 
10 else if property is defined for this class then 

set its value 
else signal an error 
end if 
end for 

15 

Procedure: Resolve (for VariationContainer classes) 
Parameters 

- (input) 

An instance of WorkingCondition 
20 (a list of purpose parameters and their current values) 

- (input) 

An instance of List 

(a list of the desired representation levels to output) 

- (output) 

25 An instance of DocumentObjectSet, 

i.e., a set of references to resolved DocumentObjects, 
with one set member for each desired level of representation 
of this VariationContainer instance 
Algorithm: 

30 if Resolve has already been called with this WorkingCondition then 

result is the previously saved result 
else 

call Satisfies on each variation of this Variation Container until 
the WorkingCondition is satisfied 
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call Resolve on this satisfying variation to 

return a DocumentObjectSet 
add the appropriate annotations for this VariationContainer into 
the returned DocumentObjectSet 
5 end if 

if no variation satisfies the WorkingCondition then 
result is null 

else 

result is the annotated DocumentObjectSet for this VariationContainer 
10 end if 



Procedures used by Variation classes 

These procedures are used v^ithin Variation classes (e.g., DocumentVariation, 
SeciionVariation), which therefore have a selection condition. 
15 Procedure: SetProperties 

Parameters: A list of property names and their corresponding values 
Algorithm: 

for each property in the list do 
if property is a condition then 
20 set the class property "condition" to given value 

else if property is a list of annotations then 

set the class property ''annotations" to given value 
else if property is a list of components then 
set the class property ''componentList" to given value 

25 

else if property is defined for this class then 

set its value 
else signal an error 
end if 

30 end for 
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Procedure: Satisfies 

Parameters: 
- (input) 

An instance of WorkingCondition 
5 - (output) 

A Boolean value 
Algorithm: 

if this variation satisfies the WorkingCondition then 

result is true 
10 else 

result is false 
end if 



Procedure: Resolve (for Variation classes) 
15 Parameters: 

- (input) 

An instance of WorkingCondition 

- (input) 

An instance of List 
20 (a list of the desired representation levels to output) 

- (output) 

An instance of DocumentObjectSet, 
i.e., a set of references to resolved DocumentObjects, 
with one set member for each desired level of representation 
25 of this Variation instance 

Algorithm: 

create a new DocumentObjectSet for this Variation 
for each component of this Variation do 
call Resolve on the current component to 
30 return a DocumentObjectSet for this component 

add the appropriate annotations for this Variation into 

the returned DocumentObjectSet 
attach the annotated DocumentObjectSet 

as a child of the DocumentObjectSet for this Variation 
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end ibr 

result is the annotated DocumentObjectSet for this Variation 



1 ()|)level Procedures 

5 These are the toplevel procedures used to read in a datafile containing 

a master document, create instances of the document-class objects, then loop lo read 
in new values of the purpose parameters and generate an appropriately customized 
version of the document, with all appropriate annotations, at each level of 
representation as specified by the representation-level parameters given in the 
10 datafile. 



Procedure: Parse 

Parameter: 
Name of datafile to be read in 

Algorithm: 
while end-of-file has not been reached 
read in the next line of the datafile 



20 if the line is a comment or a blank line then 

skip over it 

else if the line specifies the toplevel DocumentObject then 

set the "toplevel" variable to reference this object 
else if the line specifies the purpose parameters and their values then 
25 set the '"purposeParameters" variable 

else if the line specifies the possible representation levels then 

set the ''repLevels" variable 
else instantiate the specified document object 

(i.e., create a new executable instance of this document-object class, 
30 set a reference to this instance, 

assign a name to this instance, 
assign its properties) 
end of while loop 
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return the reference to the toplevel DocumentObjecl, 

the list of purpose parameters and their possible values, 
the list of desired representation levels 



5 Procedure: Reader 

Algorithm: 
call an application-specific interface to: 

- read in current values of purpose parameters 

- read in desired representation levels 

10 - create an instance of WorkingCondition using these values 

Procedure: Main 

Parameter: 
Name of the input datafile 
15 Algorithm: 
call Parse 
call Reader 

while new purpose parameters are input 
call Resolve on the toplevel object 
20 for each specified level of representation 

output the corresponding DocumentObjectSet 
end for 
call Reader 
end of while loop 

25 

The present application has been described with reference to a presently preferred 
embodiment. N4odifications and variations of that embodiment will be apparent to a 
person of skill in the art. Such modifications and variants are believed to be within 
the scope of the present invention as defined in the claims appended hereto. 

30 
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Tl-IE EMBODIMENTS OF THE IN\^ENTION IN WHICH AN EXCLUSIVE 
PROPERTY OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS: 

1 . A computer system for generating custom versions of a master document in 
accordance with user purpose parameters, said system comprising: 

5 (a) a datafile including a data structure specifying said master document; 

(b) a document class library for providing general document class definitions; 

(c) a parser for creating executable instances of said document-class data 
stnictures from said datafile; 

(d) a user input interface for inputting said purpose parameters; and 

10 (e) a selection engine for generating said custom versions of said document from 

said instances of said document class library by utilizing current values of said 
purpose parameters. 

2. A computer system as defined in claim 1, said document including a plurality of 
15 elements and said data structure defining relationships between said elements and 

variations thereof. 

3. A computer system as defined in claim 1 , said instances including a plurality of 
elements and variations each arranged in a hierarchy of predetermined 

20 relationships. 

4. A computer system as defined in claim 3, each element including a resolver for 
selecting from its variations and each variation including a resolver for expanding 
said variation into its component document objects, 

25 

5. A computer system as defined in claim 3, said elements being the components of 
said document structure. 

6. A computer system as defined in claim 1, said instances each including linguistic 
30 infonnation for providing automated grammatical and stylistic correction of said 

customized documents. 

7. A computer readable memory device encoded with a data structure for generating 
custom versions of a master document, the data structure having a plurality of 
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general class objects for describing elements and variations providing, structure 
and content of said master document. 

8. A device as defined in claim 6, said data structure including: 

5 (a) a first data block for identifying user purpose parameters for selecting versions 

of said master document; 

(b) a toplevel object for identifying a document-class instance; and 

(c) a program structure for specifying the form and content of said custom 
versions of said master document. 

10 

9. A device as defined in claim 7, said data structure including an external object for 
describing instances of an external class for defining links to other master 
documents to thereby to create a network of systems for generating custom 
documents. 

15 

10. A device as defined in claim 8, said links being hypertext links. 

1 1 . A method for generating custom versions of a master document implemented in a 
computer system and in accordance w^ith user purpose parameters, said method 

20 comprising the steps of: 

(a) specifying said master document having a datafilc and including a data 
structure; 

(b) providing a document class library having general document class definitions; 

(c) parsing said datafile for creating executable instances of said document-class 
25 data structures; 

(d) a user input interface for inputting said purpose parameters; and using current 
values of said purpose parameters to generate custom versions of said 
document from said instances of said document class library, 

30 12. A method as defined in claim 1 1, including executing said instance for selecting 
from variations of an element and expanding each variation into its component 
document objects, wherein each said instances includes a plurality of elements and 
variations each arranged in a hierarchy of predetermined relationships. 
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Block 1 — Tlie parainetei s: 

PurposeParanieters 

(<feature l>=<list of possible values>)& 
(<featiire 2=<last of possible values>)& 
[Subsequent lines define remaining features] 

RepresentationLevelParameters 

(<list of representation levels to outpxit>) 

Block 2 — Identification of the top-level object: 

toplevel=Document . <nanie of main Docuiiient> 
Block 3 — Definitions of all Documents and their valuations: 

Siib-block: Definilioii of a Document and its variations. 
Field: Definition of the Document. 

Dociunent <naiue of docuiiient> 

I vaxiations=<list of variations of the docuinent>& 
aiinotations=<list of annotat ions> I 
Field: Definitions of the Document's variations. 

Subfield: Definition of a variation of the Document. 

Document Variation <najne of document variation> 
I condition=<Boolean express ion>& 

componentList=<list of components of the document variation><fe 
ajinotations==<list of annotat ions> j 
Subsequent subfields define remaining variations of the Document. 

Subsequent sul>blocks define remaining Documents and their variations. 
Block 4 — Definitions of all Sections and their variations: 

Sub-block: Definition of a Section and its variations 
Field: Definition of the Section. 

Section <naine of section> 

I variations=<lrst of variations of the section>& 
aLnnotations=<list of annotat ions> 1 
Field: Definitions of the Section's variations. 

Subfield: Definition of a variation of the Section, 

Section Variation <nanie of section variation> 
! condition=<Boolean expression>& 

componentList=<list of components of the section variation>& 
ajinotataons=<list of aiinotations> i 
Subsequent subfields define remaining variations of the Section. 

Subsequent sui:>-blocks define remaining Sections and their variations. 
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Block 5 — Definitions of all Topics and their variations: 

Sub-block: DefniiLion of a Topic and its \ariations. 
Field: Definitioti of ihe Topic. 

Topic <najne of topic> 

I variations=<list of variations of the topic>i 
annotations=<list of ajinotat ions> I 
Field: Definitions of Hie Ibpic's variations. 

Suhfie!d: Definiuon of a variation of the Topic. 

Topi cVariat ion <na:ne of topic variatiorj> 
|condition=<Boolean expression>fe 

componentList=<list of components of t:;e Topic variation>& 
annotations=<list of annotations> I 
Subscqucni siibficld.s define reniaiinng variations of Uic Topic 

S\ii)sequcni sul>-blocks define remaining Topics and liieir variaiions. 
Block r> — Definitions of all Sentences and their variations: 

Siib-hlock: Definition of a Sentence and its variations. 
Field: Definition of the Sentence. 

Sentence <name of sentence> 

i vaxiations=<list of variations of £entenc€>4: 
annotations=<list of annotatioiis> I 

Field: Definitions of the Sentence's variations. 

Su!>field: Definition of a variation of the Sentence. 

SentenceVariation <narae of sentence variacion> 
i condition=<Boolean expression>& 
componentList=<list of components>fe 
leV€lList=<last of representation leveis>& 
aiinotations=<list of aiinotations> I 
Subfield: Definitions of the Sentence's representations. 
SentenceRepLevel <name> 

! repLevel=<list of representation levelE>& 
componentList=<a string or list of conipcuentE>& 
2Lnnotations-<list of annotations> i 
Snliseqneni subfields define rematning variations of ilic Senten'-^' and then irpsri-eiaations 

Subsequent sub-blocks dci'iDC remaining Sentences and their \anations 
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Block 7 — Uefiiiitioiis of all Lexicals and tiieir variations: 
Suh-biock: DefiniUon of a Lexical and lis variations. 
Field: Defmitjon of the Lexical. 

Lexical <naine of lexical itein> 

I variations=<list of variations of lexical itein>& 
aiinotations=<list of annotations> I 
Field: Definitions of the Lexical *s variations 

Suhfield: Definition of a variation of the Lexical. 

LexicalVariation <name of lexical variation> 
I condition=<Boolean expression>& 
string=<character string>& 
value=<DocuinentClass> . <ob3ectName>& 
annotations=<list of annotat ions> 1 
Subsequent subfields define remaining variations of the Lexical. 

Siii)sequent bul>blocks define remaining Lexicals and their variations. 

Block 8 — Definitions of all Words: 

Field: Definition of a Word. 

Word <word> 
I val\ie="<word>*' I 

Subsequent fields define remaining Words. 
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Annotation <naine of the Annotation object> 
|<property>=*'<value>"& 
<property>="<vali]e>"& 

<Subsequent lines define remaining properties and their values>& 
parent=<najae of the immediate ancestor of this Annotation object>| 
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External <ncLine of the External object> 
If ileNajne="<f lie najne>"& 

prof ile=<list of parameters describing current user/audience> 
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European Patent Office 

Erhardstrasse 27 
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Germany 

Attention: G. Barraco, Examiner 



This is in response to the First Written Opinion dated May 21, 1999. 

The Examiner has indicated that claims 1 to 12 lack inventive step in view of 
reference Dl: C. Dimarco et al.: *The automated generation of Web documents that are 
tailored for the reader". 

The Examiner has indicated that the customization of documents for different 
audiences and purposes is known in the art as indicated and implementing such a method in 
an object oriented approach is obvious. 

The Applicant has carefully reviewed the Examiner's statements and the Dl reference. 
Applicant respectfully believes that the Examiner's grounds for rejecting the claims are 
unfounded, particularly in light of the following comments. 

The present invention defines a computer system and method for generating custom 
versions of a master document. The master document is defined by a data file including a 
data structure which describes the content of the master document, a document class library 
which contains generic descriptions for the various versions of the document, a parser which 
reads both the data file and the document class library to create executable instances of the 
document and a selection engine which uses purpose parameters to generate custom versions 



RESPONSE 



Dear Sir: 



2 

of the document from the various instances of the executable document as generally defined 
in independent claims 1 and 11. 

On the contrary the Dl reference describes a system for customizing documents 
wherein the master document is a script file that has to be run each time the purpose 
parameters are changed. There is no hint of suggestion in Dl of having a document class 
library containing generic descriptions. In Dl, the data file which is the master document is 
reread whenever new purpose parameters are provided by the user. There is no concept of 
instantiation in Dl as required and defined in the claims of the present invention. 

Furthermore in Dl there is no suggestion of a selection engine, a parser or an object 
orientated class library. The process of instantiation is important in the present invention in 
that it provides a "program" that holds its state which is then subsequently processed by a 
selection engine for producing the customized document in response to the user purpose 
parameters. On the other hand, in Dl, the master document is a script file that has to be run 
each time the purpose parameters are changed. Thus there is no disclosure of a generic 
master document which is content independent. The Dl reference defines merely instances 
of a master document which is content dependent. 

The Examiner has indicated that it is "obvious, when implementing the program 
design of such methods, to adopt the object or approach according to now a days trends". 
The Applicant has carefully reviewed the Dl reference and can find no suggestion or 
indication of adopting an object orientated approach to create customized documents based 
on the data file of Dl. The Examiner has also failed to provide any other indication of the 
state of the art regarding object oriented approaches with regard to customized document 
generation. Thus it is believed that the Examiner's objection is not well founded and is 
merely the result of hindsight analysis. 

For the above reasons it is believed that the objections to independent claims 1 and 1 1 
are overcome and consequently the objections to the dependent claims 2 to 10 and 12 are also 
overcome. 
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Reconsideration of the claims on their merits is courteously requested. 



Respectfully submitted, 
Orange CH ARI PILLAY 
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